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Abstract 

We study long-term growth-optimal strategies on a simple market with 
transaction costs. We show that several problems of this sort can be 
solved in closed form, and explicit the non-analytic dependance of optimal 
strategies and expected frictional losses of the friction parameter. 



1 Introduction 

The purpose of this paper is to present results on optimal control of invest- 
ment strategies with friction. The discussion is for the most part couched in 
the frame-work of growth-optimal portfolios of underlying securities, the prices 
of which are smooth diffusions. We will show how to treat such problems in 
a Fokker-Planck framework, as an alternative to the Hamilton-Jacobi-Bellman 
equation methodology used in financial economics ||l0| , ^ , and explicit that sev- 
eral problems of this sort may be solved analytically m closed form. It appears 
clearly in the Fokker-Planck formulation that optimal strategies and expected 
friction lossses typically have a singular dependence on the friction parameter. 
For underlying smooth diffusions the friction losses scale with friction parameter 
to the the fractional power of two thirds. We consider also the more general 
situation, where the underlying is not a smooth diffusion, generally limiting our- 
selves to the discrete time case, and show how to relate to the optimal growth 
rate to the eigenvalue of the proper evolution operator. 



The subject matter of this paper is not new, and the resuh that optimal con- 
trol of smooth diffusions with linear friction is singular is well known in control 
theory, see e.g. 0. We believe that juxtaposing derivations in terms of Fokker- 
Planck and Haimlton-Jacobi-Bellman may help to make results obtained in the 
second language more accessible to physicists, who are typically more conversant 
with the first. On the other hand, the Fokker-Planck approach has a certain 
simplicity, and facilitates order-of-magnitude discussions and the use of a kind 
of dimensional analysis. An original motivation for this work was to derive an 
option pricing procedure by using growth-optimal criteria, generalizing work in 
the friction-less case ||, This is technically a harder problem. Various ap- 
proximate solutions are possible and will be discussed in a forthcoming separate 
contribution Q]. 

2 Stocks, bonds and transaction costs 

In this section we derive the dynamics of a stock and bond portfolio with propor- 
tional transaction costs in discrete time, and then take the continuum limit. We 
adhere to a representation of the dynamics given by a pre-hedging prescription. 
Other representations are possible, and give rise to dynamical equations, which 
although equivalent in content may superficially look different. A discussion of 
other prescriptions will be presented separately We start by assuming that 
a source of of uncertainty is provided by the price movement of stock 



where time is discrete, St is the price of a share at time t and the Ut's are 
independent, identically distributed random variables. We also assume a risk- 
less security (a bond or a bank account), with price movement 



By a change of numeraire we can always take r equal to one. We then assume 
proportional transaction costs. That is, an investor who holds some capital C in 
stock or bonds at some time t, and would like to change his holding to the other 
security, would after the operation hold capital (1 — 7)C, where 7 is a non-zero 
number. Several interpretations of 7 arc possible. One is simply that there 
are direct fees coupled to every transaction. In real markets there are various 
fees, some proportional as we assume, some a fixed commission per transaction, 
and some with other structure such as different proportional commissions for 
buying and selling stock. For a real operator on a real market there are also 
indirect costs, such as overhead and operating costs, which, insofar as they can 
be determined from the outside, are perhaps roughly proportional to trading 
volume. Another interpretation of 7 is that there is really no one price, but 
only bid and ask prices, with some spread in between. If spread is proportional 
to price this will have the same effect as proportional friction. 

We consider at time t an investor who holds capital Wt, of which AW^ is 

invested in stock. The fraction invested in stock is hence It = The in- 

vestor would like to change the amount invested in stock to atWt, and when 
rebalancing his portfolio he has to pay transaction costs of 



St+At — utSt 



(1) 



Bt+At = rBt . 



(2) 



j\atWt-ltWt\ 



(3) 
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The total wealth of the investor before the market moves again is hence {1 — "flat — k) Wt, 
and after the market has moved at time (t+At) the wealth and fraction of wealth 
invested in stock are 



W(t+At) {ut+Atat + - at) - j\at - lt\)Wt (4) 

Ut+Atat 

Ht+At)at + (1 - at) - 7|at - h\ 



Ut+At^t ,J.^ 

'(t+At) - z „, , /I — — — zn. — M 



We see that the ItS can be considered as consecutive positions in a series of 
gambles, or in a repeated game against Nature. The control variables at are on 
the other hand strategy choices made at different times (and in different posi- 
tions) in the game. The game is multiplicative, since pay-off is proportional to 
capital, and it is Markov because the new capital and new position (M^(f+At) and 
l(t+At)) depend parametrically on the previous position {It). A simple special 
case is when transaction costs are zero and the dependence of wealth on previous 
position drops out. The game is then Bernoulli, and the only relevant variable 
is the strategy at- The optimal strategies according to the growth-optimal cri- 
terion in this class of models were determined by Kelly ||^. For recent papers 
and reviews with references to the later literature, see e.g. |^, |^, |^. 

The continuum limit of (jj) and ^) is reached by taking the random events 
to occur at times 0, dt, 2dt, . . ., ancTmodellling the stock price fluctuations by a 
stochastic differential 

Mi = 1 + fidt + adBt (6) 

dBt being increments of a Wiener process. Furthermore we assume that the 
control variable at is smoothly related to the position It by 

at = lt + Ptdt (7) 

The wealth fraction then changes as 

kt+dt) = lt + [(p-(7^h)lt(l-lt)+(3t+7lt\Pt\]dt + 

[altil - lt)]dBt + h.o.t (8) 



where we have taken dBt to be formally of size ^/dt and kept all terms up to 
linear in dt. Hence 

dlt = {{^i-a^lt)lt{l-lt)+f3t+lltm)dt + {altil -It)) dBt (9) 

The wealth process can be written down in a similar fashion, after applying Ito's 
lemma in a form equivalent to 

dlogWt = (^lilt - - l\Pt\^ dt + altdBt (10) 

It is also useful for further reference to observe that without control the stochas- 
tic differential equation (||) preserves the three intervals [—00, 0], [0, 1] and [1, 00]. 
A candidate for an optimal control will not move the process out of the best of 
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the three intervals. Assuming for definiteness that the best interval is [0, 1] we 
introduce the variable ^ = log and find 

d^t^[K + ^i-^a^]dt + adBt K= (l + e«) (/3(l + e-«)+7|A|) (11) 

The variable ^ ranges from — oo to +oo. Without control (/3 and k equal to 
zero) ^ follows a normal diffusion process with constant drift equal to /z — ^cr^. 
For long times without control the process therefore ends up with ^ far from 
the origin, that is I close to zero or one. If the best intervals in I are [— oo, 0] or 
[l,oo], the process without control in an analogous manner ends up in or 1, 
respectively. In other words, if you do not rehedge you eventually end with all 
your money in stock, or, depending on the parameters, with all your money in 
bonds. 



3 Long-term growth with transaction costs 

We start with some general remarks, borrowing some of the terminology from 
the theory of dynamical systems. Let us consider the dynamics of wealth and 
wealth fraction, where we write the strategy (at) as a function of position (It): 

W(^t+At) = {uit+At)atilt) + [1 - atih)) - llatih) - lt\) Wt (12) 

, ^ U{t+At)at{lt) , , 

Uit+AtMlt) + il-atilt))-l\atik)-k\ ^ ^ 

The dynamics in I induces a linear evolution equation for the probability distri- 
bution over the states in the corresponding Markov chain (i.e., the values of I). 
The asymptotic distribution in I is the eigenvector corresponding to the highest 
eigenvalue of the corresponding operator, let that be written Po(^; [a\)dl. The 
long-term growth rate is thus 

A(7;[«]) = I Po{l;[a])E[\og{l + a{l){u~l)~j\a{l)-l\)]dl (14) 

the expected logarithmic return with respect to the joint distribution over I 
and the random variable u Q Maximizing long-term growth is a non-trivial 
problem because the strategy a enters both in the function to be averaged, i.e. 
log(l -t- a{l){u — 1) — — and in the measure of integration. 

To proceed further we turn to the continuum limit, as defined by equations 
(M and ([lO|). We will consider a model problem, and show how that can be 
solved byTokker-Planck methods. Then we will repeat the model problem in 
the Hamilton-Jacobi-Bellman formalism, and then return to the full problem of 
equations (O) and (|lO|). A kind of dimensional analysis and possible extentions 
of the procedure arediscussed separately in section ^. 

^In our model / and u are independent, so the joint distribution is the product of the 
distributions over I and u separately. In a proper game one would expect that the u would 
not be independent of Z — and also dependent on the positions of the other players in the 
game. 
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3.1 A model problem 

Let us consider the stochastic differential equation 

'dV 



dxt 



dx 



dt 



2DdBf 



(15) 



and suppose that we wish to maximize 



U ^ E 



ax — —bx^ 



dV 

I — 

' dx 



(16) 



where the expectation value is taken with respect to the stationary distribution 
of X, 



E [f{x)] - / N-^e-^f{x)dx. 



(17) 



The normalization factor N is fixed by E[\] ~ 1. The problem is hence to 
maximize with Lagrange multiplier q the functional 



-bx^ 
2 



c\^\\dx 



o dx 



N 



(18) 



We vary with respect to V'{x) to find 



6V'{x) 



N- 



.Xl2Ll, 1 ^ 
D ( ) 

^ d' 



dV 



dx 



dV 



(19) 



At an extremum we have gy^^ = 0, for every x such that V{x) < oo. If 

we differentiate with respect to x two terms cancel, and we have the simpler 
equation 

ax - hx'^ + qN- 2Dc5{V'{x))V"{x) = 0. (20) 



Clearly it is not possible to satisfy ( po| ) unless V' = 0. If y = the equation 
can be understood in the distributional sense, for instance by broadening the 
S function to be 1/e on the interval [— e/2,e/2], and zero elsewhere. Then ( pO[ ) 
is equivalent to the ordinary differential equation V" = j^^*^^ ~ 5^^^ + 1-^) 
as long as V, the integral of V" , is not larger than e/2 in absolute value. This 
equation can obviously be satisfied on an interval in x of finite width. To find the 
optimal interval without making an explicit assumption on the regularization, 
we just assume that V is large outside an interval [a;_, a;+] and that V' is small 
in the same interval. The quantity to maximize can then be made arbitrary 
close to 



2 X4. — x^ 



b{x+l_ 
6 x+ 



(x_)3 



2Dc 



X- 



x^ 



(21) 
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which is maximized by 

a 1 /24:cD\ i , , 

The optimum control is thus given by a "deep-pit" potential with mid-point | 
and width (^^f^) • The value of the maximum is 

Urn..^^-^{9bD^c'y (23) 

The loss of utility is therefore proportional to friction parameter c to the power 
two thirds. 



3.2 Hamilton-Jacobi-Bellman 

Dynamical programming is formulated in the setting that we have a SDE 

d^t^ f{Ct,u)dt + a{(t,u)dBt, ^s=x (24) 

where u = u{x,t) is a dynamic control hmction. We wish to extremize a func- 
tional, generally of a form like 

J(x,s) = mf^|i;5,=,^ dtL{^uD+^t) + J{CT.T)^ (25) 

where the forward derivative is defined as 

D-Fi^,) = hmi.,... |^XW±^|O^Z(M| . (26) 



The solutions of (24) are assumed to be smooth diffusions for any admissi- 
ble choice of u. In our model problem there is no control of the diffusion 



term, i.e. a{^t,u) — \/2D does not depend on u. The drift term /(^t,u) is 

dV 
dx 



on the other hand — and we may also identify with /. Hence, if 



H^uD+^t) = -a^t + + c\D+^t\ and J{^t,T) = 0, then is the min- 

imum of the average value of the functional L which can be achieved in the 
time interval [s,T], given that the process starts in point x at time s. If the 



process (24) is stationary, then the large T limit of this average will be almost 
surely opposite in sig n but equal in absolute value of the expectation value to 
be maximized in (Rw. 

The idea is to recast the problem in terms of a differential equation. Bellman's 
principle states that for any h between and T — s 

J{x, s) = mf I E^^=, ^ ^ dtL{^t,D+^t) + J{is+h,s + h) I (27) 
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With the assumptions that the process ( |2^ ) is a smooth diffusion (already made), 
and that J(x, s) is a smooth function of x and s (which follows if (p3) is a smooth 
diffusion and s less than T), we have 



1 r"'^ 

= lim inf <^ - / dtL{^uD+^, 



= inf + + J(x, s) (28) 

uGK ( Z J 

Substituting L, cr(^t,u) and introducing A = — J we have 

dsX + sup {/ d^X + Ddl\ + ax-^--c\f\} = Q 

f ^ 
X{x,T;T) = (29) 

Naive variation with respect to / leads to dxX = csign(/). This is problem- 
atical because the sign function is not invertable, and we therefore have no 
equation for the control. Furthermore, control entirely drops out of (P9|), and 
we have instead the much simpler equation dgX + D d^X + ax — | = 0, with so- 
lution A(a;, s) = (T — s){ax — ^x^) — The average A/ (T — s) decreases 
asymptotically linearly as —Dh{T— s)/2 with increasing T. In fact, all we have 
solved for is the expected value of of an uncontrolled random walk 
with diffusion coefRcient D starting from x and moving for a time T — s. 

The problem is that motion in a "deep-pit" potential, which we already know 
is the asymptotically optimal control, is not a smooth diffusion at the bound- 
ary. For a full discussion, see e.g. to extend (slightly) the Hamilton- Jacobi- 
Bellman in a less rigorous manner, assume an interval [a:;_,a:_|_] such that the 
restoring force / goes to positive and negative infinity, respectively, when x 
tends to the boundary points x^ and x^ from the inside. Outside the interval 
/ is supposed to remain infinite, so that the process is constrained to be in 
[a;_,a;+]. If further the variational equation A = csign(/) is interpreted in 
the distributional sense, for instance by broading the sign function to tanh 
then / is determined as a function of dxX inside the interval, of size e away from 
the boundaries, and A satisfies the boundary conditions 

dxX\x=x- = {-dxX\x=x+) = c (30) 



The dynamic programming equation (g9|) can now be solved for large T, if we 
make the ansatz 

(T - s) 

X{x, s; T) h asymptotically constant in T (31) 

T 

When (|3|) holds true one integration in x in ( |2^ ) gives DdxX = A + ^x — ^x^ + 
^ and the two boundary conditions read 



1 flofoo /.I a o b ^ , , ^ 

A + -X- x_ + -xt = - M + -x+ xX + -x% = Dc (32) 

T 2 6 \T 2 6 
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which are easily separated into one equation for the constant of integration A, 
and one equation for r, which is identical to (|2l|). We have therefore derived the 
same result again in the HJB formalism. As a consistency check for the solution 
one verifies that in the interval [x-, a;+] the absolute value of the gradient of A 
is bounded by c. 



3.3 The full stock and bond problem 

It is convenient to consider the problem in terms of the transformed variable 
^ of (pi]), which obeys a stochastic differential equation entirely similar to the 
model problem (15) with —dxV = {k + fi — (T^/2) and D = tT^/2, but where the 
function to maximize is 



F ^ E 



-cr 



l + e«y 2 V(l + e^)V 1 + e« 1 + e«(l +7sign(K)) 



(33) 



In deriving (^) we have expressed the control /3 (of I) in terms of the control k 
(of ^). Let us write F = E[g{x,u)\u=d^v]- The equation analogous to ( pO| ) will 
then be 

d^Vd^g IdVdg d'g 1 

-7r-Y-r^\u=d^v - —^-^\u=d^v + -r-r\u=d^v + j^ig - qN) ^ (34) 

ax du D ox du dxdu D 

This equation will not have solutions (when V < oo) unless k vanishes. The 
invariant density is hence 

P(e) - 7V-iexp(^^i-^) e € (35) 

and the two endpoints ^_ and ^+ can be determined by maximizing 



l + e«y 2 V(l + e9^ 



2 iV V l + e«- l + e«-(l + 7) 

^2 7vlv l + e?+ l + eC+(l~7)y' ^"^^^ 

In the small 7 limit it is sufficient to observe that the optimal interval is centered 
on Z = -i^, and that the width is determined by (H) with D ^ -^f , 

b = and c = j. Inserting these quantities we have 



A/; 



(l27(ii)2(i_ii)2)^ 7 small (37) 



When /Lt/cr^ is close to zero or one the width of the holding interval vanishes. 
This is natural, since if the optimal investment is to hold all money in stock 
or bonds no rehedging is necessary. On the other hand, when /i/cr^ is around 
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one half the optimal holding interval is about 0.9 73. For realistic values of 
7 this is not a small number. For instance, if 7 is one percent, and the best 
portfolio without friction is to hold half each in stock and bonds, then the 
optimal strategy is to not rehedge unless the fraction in stock falls below 40 or 
rises above 60 percent. 



4 Dimensional analysis 

It is instructive to see that most of the results can be derived by a kind of 
dimensional analysis. Let us consider the model problem, and call the dimension 
of the quantity to maximize U. In the full problem it would be wealth per time. 
Let L be the dimension of the quantity x. The dimensions of the parameters 
are then 

[a] = j [b] = ^ [c] = ^ [D] = ^ (38) 

From a and b we can form one quantity with dimension L, namely a/b. This 
is the optimal value of x without friction. From b, D and c we can also form 
one quantity with dimension L, {Dc/b)^ ^ which, up the numerical prefactor, is 
indeed the width of the holding interval. 

From D and the width of the holding interval {d) we can form a character- 
istic time between rehedgings t = cP jD. If the growth-optimal criterion is to 
have a special significance for an individual investor we must suppose that his 
investment horizon is much longer than r. For realistic parameter values this 
implies surprisingly long time scales. If we use the previously discussed example 
with 7 = 0.01, and assume a one percent per day, then the characteristic time 
is 800 days, about three years of trading time. If we assume instead the extreme 
and probably unrealistic value of 7 one part in a million, then the holding in- 
terval is still one percent in width, and the characteristic time is measured in 
days. The conclusion is, that in the very simple model of one stock and one 
bond, with transaction costs, it is delicate and rather doubtful if times are ever 
sufficiently long that the growth-optimal criterion is unquestionably valid. If 
we enlarge the model to include more securities, especially derivative securities 
the return distributions of which depend explicitly on time to maturity, this 
conclusion can only be strengthened. For the growth-optimal criterion to be 
of special significance one would have to discuss not individual investors, but 
evolving populations of investment strategies, and to show, if true, that growth- 
optimal strategies in multiplicative games approximate non-cooperative Nash 
equilibria, selected by the Nash "mass-action" principle |l^. For a recent dis- 
cussion of populations of evolving strategies, see also @. 

We can also use dimensional analysis to consider sources of uncertainty other 
than smooth diffusions. If a noise term dwt is formally taken to be (dt)", then 

D will be of dimension L^/T^", the characteristic width d = D'^+^b~'^ 

and T — (yjj^ ■ The expected friction losses per unit time are cd/r. A simple 

strategy with this scaling is to bring back the portfolio to the optimum (cost 
cd) every time the fraction goes out of a band of width d (event occuring with 
frequency l/r). For smooth diffusion this is the same scaling as in (GS); only the 
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prefactor will be better with the optimal strategy. Since for realistic parameter 
values the characteristic times are typically long, it however probably suffices 
that the noise is Gaussian and uncorrelated on those time scales for the optimal 
control of smooth diffusions that have mainly been discussed here to be relevant. 



Acknowledgments 

We thank P. Dimon, S. Ghirlanda, O. Hammarlid, M. van Hecke, A. Kozlov, 
G. Lacorata, A. Martin-L6f, G. Smirnov and K. Zyczkowski for discussions. 
E.A. thanks Prof. S.M. Gama and the Departamento do Matematica Aplicada 
of Universidade do Porto for hospitality. We thank the organizers of European 
Dynamics Days 1999 (Como June 20-23, 1999) and the European Physical So- 
ciety Conference "Applications of Physics to Financial Analysis" (Dublin July 
14-17, 1999) for opportunities to present this work, and the Swedish Natural 
Research Council and the European Community for financial support under 
grants M-AA/FU/MA 01778-333 (E.A.) and ERB4001GT962476 (P.M.-G.). 



References 

M. Akian, J-L. Menaldi and A. Sulem, "On an investment-consumption 
model with transaction costs", SIAM J. Control and Optim., 34 (1996) 
329-364. 

E. AureU, R. Baviera, O. Hammarlid, M. Serva and A. Vulpiani, (1998), "A 
general methodology to price and hedge derivatives in incomplete markets" , 
IJTAF [in press]. 

E. AureU, R. Baviera, O. Hammarlid, M. Serva and A. Vulpiani, "Gam- 
bling and Pricing o f Derivatives" , (1998), So cial Science Research Network 



Electronic Library (http://papers.ssrn.com/) 



E. AureU and P. Muratore-Ginanneschi (in preparation). 

J.Doyne Farmer "Market fo rce, ecology, and evolution" , Los Alamos e-Print 
Archive |adap-org/9812005| . 



W.H.Fleming and H.Mete Soner Controlled Markov Processes and Viscosity 
Solutions (Springer- Verlag Berlin, 1992). 

N. Hakanson and W. Ziemba, "Capital Growth Theory" , in Handbooks in 
OR & MS, Vol.9, eds. R. Jarrow et al. (Elsevier Science, 1995). 

Jr. J. L. Kelly, "A new interpretation of the Information Rate", Bell Syst. 
Tech. J. 35 (1956) 917. 

S. Maslov and Y.-C. Zhang, "Optimal Investment Strategy for Risky As- 
sets", Lnternational J. of Theoretical and Applied Finance 1 (1998) 377-388. 

R. Merton, Continuous- Time Finance (Blackwell, 1990). 

J. Nash, Non-cooperative games, Ph.D. thesis (Princeton University, 1950). 



10 



